Identification of cromosomal translocation hotspots via scan statistics
نویسندگان
چکیده
Motivation: The detection of genomic regions unusually rich in a given pattern is an important undertaking in the analysis of next generation sequencing data. Recent studies of chromosomal translocations in activated B lymphocytes have identified regions that are frequently translocated to c-myc oncogene. A quantitative method for the identification of translocation hotspots was crucial to this study. Here we improve this analysis by using a simple probabilistic model and the framework provided by scan statistics to define the number and location of translocation breakpoint hotspots. A key feature of our method is that it provides a global chromosome-wide significance level to clustering, as opposed to previous methods based on local criteria. Whilst being motivated by a specific application, the detection of unusual clusters is a widespread problem in bioinformatics. We expect our method to be useful in the analysis of data from other experimental approaches such as of ChIP-seq and 4C-seq. Results: The analysis of translocations from B lymphocytes with the method described here reveals the presence of longer hotspots when compared to those defined previously. Further, we show that the hotspot size changes quite substantially in the absence of DNA repair protein 53BP1. When 53BP1 deficiency is combined with overexpression of activation induced cytidine deaminase (AID) the hotspot length increases even further. These changes are not detected by previous methods that use local significance criteria for clustering. Our method is also able to identify several exclusive translocation hotspots located in genes of known tumor supressors. Availability: The detection of translocation hotspots is done with hot scan, a program implemented in R and Perl. Source code and documentation are freely available for download at https://github.com/itojal/hot scan. Contact: [email protected] Date: October 9, 2013. 2010 Mathematics Subject Classification. Primary: 92D20, Secondary: 62P10, 62M30.
منابع مشابه
Identification of chromosomal translocation hotspots via scan statistics
MOTIVATION The detection of genomic regions unusually rich in a given pattern is an important undertaking in the analysis of next-generation sequencing data. Recent studies of chromosomal translocations in activated B lymphocytes have identified regions that are frequently translocated to c-myc oncogene. A quantitative method for the identification of translocation hotspots was crucial to this ...
متن کاملIdentifying at Highway-Rail Grade Crossing Hotspots in Canada
This research presents a risk-based Hotspots identification model at highway-rail grade crossings in Canada. Two sets of models were developed to predict collision frequency and consequence at individual crossings. A two–dimensional graphic approach was adopted to combine these two models together to predict the risk at each crossing. Hotspots based on collision history tended to be widespread ...
متن کاملGeographic analysis of forest health indicators using spatial scan statistics.
Geographically explicit analysis tools are needed to assess forest health indicators that are measured over large regions. Spatial scan statistics can be used to detect spatial or spatiotemporal clusters of forests representing hotspots of extreme indicator values. This paper demonstrates the approach through analyses of forest fragmentation indicators in the southeastern United States and inse...
متن کاملSpatial analysis of influenza incidence in EMRO using flexible scan statistics
Introduction: Influenza is an infectious and severe respiratory disease. It is one of the major problems of public health. In order to determine the spatial distribution and areas with over-expected of a disease including influenza, it can be effective in identifying environmental hazards and fair distribution of health services. In this study, the geographical distribution of the influenza and...
متن کاملUnderstanding the Mechanism Underlie the Antidiabetic Activity of Oleuropein Using Ex-Vivo Approach
Background: Oleuropein, the main constituent of olive fruit and leaves, has been reported to protect against insulin resistance and diabetes. While many experimental investigations have examined the mechanisms by which oleuropein improves insulin resistance and diabetes, much of these investigations have been carried out in either muscle cell lines or in vivo models two scenarios with many draw...
متن کامل